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(54) Method of generating audio and/or video signals and apparatus therefore 



(57) The audio and/or video (a/v) generation appa- 
ratus is arranged to receive and record meta data asso- 
ciated with the a/v signals generated by a data proces- 
sor. The meta data may be user generated, such as text 
describing the a/v signals, or may be secondary meta 
data which is automatically generated by the a/v appa- 
ratus. The recording medium may be a linear recording 



medium, and the recording means may be arranged in 
operation to record at least one of the user generated 
meta data and the secondary meta data on the record- 
ing medium after the a/v signals at a position at which 
at least one of the user generated and secondary meta 
data can be read by a reproducing means before the a/v 
signals. 
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Description 

Fieid of the Invention 

[0001] The present invention relates to audio and/or 
video generation apparatus and methods of generating 
audio and/or video signals. The present invention also 
relates to methods of recording audio and/or video sig- 
nals. 

[0002] In preferred embodiments the audio and/or 
video generation apparatus is a video camera, cam- 
corder or television camera or the like. 

Background of the Invention 

[0003] The subject matter and content of audio and 
video productions varies greatly. In addition, to this va- 
riety there is, correspondingly, a considerable quantity 
of such audio and video productions. The quantity has 
been required to meet the demand from the broadcast 
and entertainment industries. The audio productions in- 
clude, for example, radio broadcasts, both live and pre- 
recorded, musical and audio recordings, whereas video 
productions include, for example, films, television pro- 
grams and video recordings. As will be appreciated typ- 
ically video productions also include an accompanying 
sound track or commentary, so that an audio production 
is inherently included as part of the video production. 
[0004] The term audio and/or video will be used here- 
in to refer to any from of audio information or signals, 
video information or signals, or a combination of video 
and audio information or signals. The term audio/video 
will be used for short to refer to audio and/or video. 
[0005] As a result of the great variety and considera- 
ble quantity of audio/video productions, the task of lo- 
cating particular content items of audio/video material 
within an archive of audio/video productions represents 
an arduous and labour intensive task, because an op- 
erator must visually search the audio/video productions 
for the desired content item. Furthermore, because of 
the length of audio/video productions which are typically 
although not exclusively stored on linear recording me- 
dia, the task of navigating through the media to locate 
particular content items of audio/video material from an 
audio/video production is time consuming and labour in- 
tensive. 

[0006] In our co-pending UK patent application 
number GB 9921 235.9 there is disclosed a method and 
apparatus for navigating through the content of audio/ 
video material using metadata which represents the 
content of the audio/video material. 
[0007] The term metadata as used herein refers to 
and includes any form of information or data which 
serves to describe either the content of audio/video ma- 
terial or parameters present or used to generate the au- 
dio/video material or any other information associated 
with the audio/video material. Metadata may be, for ex- 
ample, "semantic metadata" which provides contextual/ 



descriptive information about the actual content of the 
audio/video material. Examples .of semantic meta data 
are the start of periods of dialogue, changes in a scene, 
introduction of new faces or face positions within a 

5 scene or any other items associated with the source 
content of the audio/video material. The metadata may 
also be syntactic metadata which is associated with 
items of equipment or parameters which were used 
whilst generating the audio/video material such as, for 

io example, an amount of zoom applied to a camera lens, 
an aperture and shutter speed setting of the lens, and 
a time and date when the audio/video material was gen- 
erated. Although meta data may be recorded with the 
audio/video material with which it is associated, either 

15 on separate parts of a recording medium or on common 
parts of a recording medium, meta data in the sense 
used herein is intended for use in navigating and iden- 
tifying features and essence of the content of the audio/ 
video material, and may, therefore be separated from 

20 the audio/video signals when the audio/video signals 
are reproduced. The meta data is therefore separable 
from the audio/video signals. 

[0008] The apparatus and method for navigating 
through the content of audio/video material disclosed in 
25 the co-pending UK patent application number GB 
9921235.9 Uses the meta data which has been gener- 
ated with the audio/video signals to navigate through the 
items of contextual or essence information of the audio/ 
video material. 

30 [0009] In a further co-pending UK patent application 
number 9921234.2 there is disclosed an editing system 
for editing source content such as audio/video material 
to produce an edited audio/video production by applying 
a template representative of a desired production style 

35 to meta data associated with the audio/video material to 
form the production. Again the meta data must be gen- 
erated with the audio/video material in order for the ed- 
iting system to form the audio/video production. 

40 Summary of the Invention 

[001 0] According to the present invention there is pro- 
vided an audio and/or video generation apparatus which 
is arranged in operation to generate audio and/or video 

45 signals representative of an audio and/or video source 
having a meta data generation tool which is arranged in 
operation to generate meta data associated with said 
audio and/or video signals, wherein the type of meta da- 
ta generated by said meta data generation tool is user 

50 selectable. 

[001 1] As discussed above there is a great variety in 
the nature and content of audio/video productions. The 
audio/video productions are created from audio/video 
material generated using, for example, video cameras, 

55 television cameras as well as computer generating 
graphics and animation processors. As such because 
of the variety in the nature and content of audio/video 
productions, the type and nature of metadata which is 
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to be associated with the audio/video material corre- 
spondingly differs. Furthermore although it has been 
known to associate metadata with audio/video produc- 
tions for facilitating asset management for archiving the 
audio/video productions, as indicated in our co-pending 5 
patent applications mentioned above, is proposed to 
use metadata for facilitating the creation of the audio/ 
video productions by editing and navigating through the 
content of the audio/video material. An improvement in 
the creation of audio/video productions is therefore 
gained by providing a metadata generation tool which 
has a facility for allowing a user to select a type of meta- 
data which is generated by the too! in association with 
the audio/video material produced. In this way the meta- 
data can be adapted to the nature and subject of the 
audio/video production to which the audio/video signal 
source content is to be applied. 

[001 2] To facilitate selection of the metadata to be as- 
sociated with the audio/video signals the metadata gen- 
eration tool may be arranged to generate metadata of a 
plurality of different types and the metadata generation 
tool may be provided with a user interface for selecting 
at least one of the plurality of different types of metadata, 
the metadata being generated in operation from the au- 
dio/video signals in accordance with the user selected 
metadata types. 

[001 3] Thus, whilst it is conceivable that the audio/vid- 
eo generation apparatus may be provided with a facility 
for generating any type of metadata, in a preferred em- 
bodiment the metadata generated by the audio/video 
generation apparatus may be of a predetermined plu- 
rality of types, which may be may selected by the user 
using the user interface. Furthermore in order to ensure 
that at least some metadata is generated with the audio/ 
video signals generated by the apparatus in case the 
user neglects to select a metadata type, the metadata 
generation tool may be arranged to generate metadata 
of a default type in absence of a user selecting from the 
plurality of metadata types. 

[001 4] Advantageously the meta data generation tool 
may be arranged to generate meta data in accordance 
with a plurality of pre-determined meta data selections, 
each of which specifies at least one of the plurality of 
different meta datatypes arranged in a pre-determined 
format, the user interface being arranged to provide the 
user with a facility for selecting one of the meta data 
selections for generating the meta data. The plurality of 
different meta data selections may be arranged in ac- 
cordance with a standard defining the type and format 
of metadata associated with the audio/video signals. 
[001 5] In order to provide a unified form of metadata 
which is common within the audio/video production in- 
dustry, standards are being developed for metadata for 
example by SMPTE-EBU (Society of Motion Picture and 
Television Engineers-European Broadcast Union) and 
by MPEG-7 (Motion Picture Expert Group, which is an 
ISO-IEC standards body SG29/WG1 1). As such by pro- 
viding the metadata generation tool with a facility for se- 



lecting one of a plurality of pre-defined meta data selec- 
tions, the selections being defined in accordance with a 
standard such as for example that proposed by SM PTE- 
EBU then the user may select an appropriate standard 
for the type of audio/video material being generated by 
the audio/video generation apparatus. 
[001 6] Although the audio/ video signals generated by 
the apparatus may be broadcast directly with the meta- 
data from a remote location, advantageously the audio 
and/or video generation apparatus may further com- 
prise a recording/reproducing apparatus which is ar-. 
ranged in operation to record the audio and/or visual in- 
formation signals onto a recordable medium with the 
metadata. 

[0017] In a preferred embodiment the audio and/or 
video generation apparatus may be a video camera, 
camcorder, television camera, cinema camera or the 
like. 

[001 8] According to an aspect of the present invention 
there is provided a meta data generation tool for use in 
generating metadata in association with an audio and/ 
or visual generation apparatus, the tool being arranged 
in operation to generate meta data associated with au- 
dio and/or video signals, wherein the type of meta data 
generated by said tool is selectable. 
[001 9] According to a further aspect of the present in- 
vention there is provided a method of generating audio 
and/or video information signals, comprising the steps 
of identifying the nature of the audio and/or video infor- 
mation signals to be generated, selecting appropriate 
types of meta data to be generated with the audio and/ 
or video signals, and generating the meta data of the 
selected type in association with the audio and/or video 
information signals. 

[0020] According to a further aspect of the present in- 
vention there is provided an audio and/or video gener- 
ation apparatus which is arranged in operation to gen- 
erate audio and/or video signals representative of an au- 
dio and/or video source, the audio and/or video gener- 
ation apparatus comprising a recording means which is 
arranged in operation to record the audio and/or video 
signals on a recording medium, wherein the audio and/ 
or video generation apparatus is arranged to receive 
metadata associated with the audio and/or video signals 
generated by processor, the recording means being ar- 
ranged in operation to record the metadata on the re- 
cording medium with the audio and/or video signals. 
[0021] An audio/video generation apparatus which is 
arranged to receive metadata generated by a data proc- 
essor is provided with an improved facility for introduc- 
ing metadata associated with audio/video signals gen- 
erated by the audio/video apparatus. 
[0022] Furthermore, the audio/video generation ap- 
paratus may be provided with a user interface having a 
predetermined format for connecting the audio and/or 
video generation apparatus to the data processor. The 
interface therefore provides a facility for the data proc- 
essor to be connected to the audio and/or video gener- 
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at ion apparatus using the interface. The predetermined 
format may be of a common type thereby providing a 
facility for a range of possible data processors to be con- 
nected to the audio/video generation apparatus. As 
such, the data processor provides a facility for a user to 
generate metadata and for including this metadata with 
the audio and/or video signals generated by the audio/ 
video generation apparatus. The metadata may be re- 
corded separately on the recording medium, from the 
audio and/or video signals. 

[0023] A further improvement is provided to the audio 
and/or video generation apparatus by providing a meta- 
data generator coupled to the recording means, the 
metadata generator being arranged in operation to gen- 
erate secondary metadata associated with the audio 
and/or video signals, wherein the user generated meta- 
data and the secondary metadata are arranged to be 
recorded in combination on the recording medium. As 
such the metadata generator which is embodied within 
the audio and/or video generation apparatus provides a 
facility for automatically generating the secondary meta- 
data which is therefore associated with the audio/video 
signals. 

[0024] The secondary metadata may be semantic 
metadata representative of operating parameters of the 
audio and/or video generation apparatus when the au- 
dio and/or video signals are generated. 
[0025] In a preferred embodiment the user generated 
metadata may be text information. Furthermore the sec- 
ondary metadata may be stored in a data store and com- 
bined with the user generated metadata by a control 
processor and recorded onto the recording medium by 
the recording means. 

[0026] The data processor may have a speech con- 
version processor which is arranged in operation to gen- 
erate the text information by converting sound signals 
representative of spoken communication by the user in- 
to the text information. This provides a further improve- 
ment in facilitating the generation of the user generated 
meta data in that although the data processor may be 
provided with an alpha numeric key pad for generating 
the user generated metadata, by providing a speech 
conversion processor, the user may speak the desired 
meta data to be associated with the audio/video material 
from which the metadata is generated and recorded on 
the recording medium. In one embodiment therefore the 
data processor may be a personal computer. In another 
embodiment, the data processor may be a personal dig- 
ital assistant such as a PSION (TM) or a PALM PILOT 
(TM) or the like. In other embodiments the audio and/or 
video generation apparatus may be a video camera, a 
television camera or a camcorder or the like. 
[0027] The recording medium may be a linear record- 
ing medium and the recording means may be arranged 
in operation to record at least one of the user generated 
meta data and the secondary metadata on the recording 
medium after the audio and/or video signals at a position 
at which the at least one of the user generated metadata 



and the secondary metadata can be read by a reproduc- 
ing means before the audio and/or video signals. 
[0028] According to a further aspect of the present in- 
vention there is provided a method of recording audio 

5 and/or video signals comprising the steps of recording 
the audio and/or video signals on a recording medium, 
generating automatically meta data in response to said 
audio and/or video signals, providing user generated 
meta data associated with said audio and/or video sig- 

10 nals, and recording said meta data on said recording 
medium with said audio and/or video signals. 
[0029] According to a further aspect of the present in- 
vention there is provided an audio and/or video gener- 
ation apparatus which is arranged in operation to gen- 

15 erate audio and/or video signals representative of an au- 
dio and/or visual source, the audio and/or video appa- 
ratus comprising a recording means which is arranged 
in operation to record the audio and/or video signals on 
a recording medium, wherein the recording medium is 

20 a linear recording medium, and the recording means is 
arranged in operation to record metadata associated 
with the audio and/or video signals on the recording me- 
dium after the audio and/or video signals at a position 
at which the metadata can be read by a reproducing 

25 means before the audio and/or video signals. 

[0030] According to a further aspect of the present in- 
vention there is provided a method of recording audio 
and/or video material comprising the steps of generating 
audio and/or video signals representative of audio and/ 

30 or video material from an audio and/or visual source, 
recording the audio and/or video signals on a linear re- 
cording medium and recording metadata associated 
with the audio and/or video signals on the recording me- 
dium after the audio and/or video signals at a position 

35 at which the metadata can be read by a reproducing 
means before the audio and/or video signals. 
[0031 ] Further aspects and features of the present in- 
vention are defined in the appended claims. 

40 Brief Description of the Drawings 

[0032] Embodiments of the present invention will now 
be described by way of example with reference to the 
accompanying drawings wherein: 

45 

Figure 1 is a schematic block diagram of a video 
camera operatively coupled to a personal digital as- 
sistant, 

Figure 2 is a schematic block diagram of the body 
so of the video camera shown in figure 1 , 

Figure 3 is an example representation of informa- 
tion recorded onto a magnetic tape. 
Figure 4 is a flow diagram representing a method 
for recording audio/video material, 
55 Figure 5 is a schematic block diagram of an ar- 
rangement in which a camcorder is connected to a 
personal computer, 

Figure 6 is a schematic block diagram of the body 
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of the camcorder shown in figure 3, 

Figure 7 is a representation of an image formed by 

the camcorder, 

Figure 8 is a schematic block diagram of a video 
camera embodying a meta data generation tool, 
Figure 9 is a schematic block diagram of a meta da- 
ta generation tool shown in Figure 8, and 
Figure 1 0 is a table representing examples of pre- 
defined meta data selections. 

Description of Preferred Embodiments 

[0033] As will be understood from the introduction and 
discussion above, embodiments of the present inven- 
tion relate to audio and/or video generation apparatus 
which may be for example television cameras, video 
cameras or camcorders. As a first illustration of an em- 
bodiment of the present invention, figure 1 provides a 
schematic block diagram of a video camera which is ar- 
ranged to be connected to a personal digital assistant. 
A personal digital assistant is an example of a data proc- 
essor which may be arranged in operation to generate 
metadata in accordance with a user's requirements. The 
term personal digital assistant is known to those ac- 
quainted with the technical field of consumer electronics 
as a portable or hand held personal organiser or data 
processor including an alpha numeric key pad and may 
also include a hand writing interface. In figure 1 a video 
camera 1 is shown to comprise a camera body 2 which 
is arranged to receive light from an image source falling 
within a field of view of an imaging arrangement 4 which 
may include one or more imaging lenses (not shown). 
The camera also includes a view finder 6 and-an oper- 
ating control unit 8 from which a user can control the 
recording of signals representative of the images 
formed within the field of view of the camera. The cam- 
era 1 also includes a microphone 10 which may be a 
plurality of microphones arranged to record sound in 
stereo. Also shown in figure 1 is hand personal digital 
assistant 1 2 which has a screen 1 4 and an alphanumer- 
ic key pad 1 6 which also includes a portion to allow the 
user to write characters recognised by the personal dig- 
ital assistant. The personal digital assistant 12 is ar- 
ranged. to be connected to the video camera 1 via an 
interface 18. The interface 1 8 is arranged in accordance 
with a predetermined standard format such as, for ex- 
ample an RS232 or the like. The interface 18 provides 
a facility for receiving metadata generated using the per- 
sonal digital assistant 12 which can be recorded with the 
audio and video signals detected and captured by the 
video camera 1 . A better understanding of the operation 
of the video camera 1 in combination with the personal 
digital assistant 1 2 may be gathered from figure 2 which 
shows a more detailed representation of the body of the 
video camera 2 which is shown in figure 1 and in which 
common parts have the same numerical designations. 
[0034] In figure 2 the camera body 2 is shown to com- 
prise a tape drive 22 having read/write heads 24 oper- 



atively associated with a magnetic recording tape 26. 
Also shown in figure 2 the camera body includes a con- 
trol processor 28 coupled to the tape drive 22 via a con- 
necting channel 30. Also connected to the control proc- 
5 essor 28 is a data store 32 and a secondary metadata 
generator 34. The secondary metadata generator 34 is 
connected to a clock 36 and three sensors 38, 40, 42. 
The interface unit 1 8 is also shown in figure 2 to receive 
first and second connecting channels 44 and 46. Corre- 
10 spondingly two connecting channels connect the inter- 
face unit 1 8 to the control processor 28 via correspond-, 
ing connecting channels 48 and 50. 
[0035] The video camera 1 shown in figure 1 operates 
to record visual information falling within the field of view 
15 of the lenses arrangement 4 onto a recording medium. 
The visual information is converted by the camera into 
video signals. In combination, the visual images are re- 
corded as video signals with accompanying sound 
which is detected by the microphone 10 and arranged 
to be recorded as audio signals on the recording medi- 
um with the video signals. As shown in figure 2, the re- 
cording medium is a magnetic tape 26 which is arranged 
to record the audio and video signals onto the recording 
tape 26 by the read/write heads 24. The arrangement 
by which the video signals and the audio signals are re- 
corded by the read/write heads 24 onto the magnetic 
tape 26 is not shown in figure 2 and will not be further 
described as this does not provide any greater illustra- 
tion of the example embodiment of the present inven- 
tion. However once a user has captured visual images 
and recorded these images using the magnetic tape 26 
as with the accompanying audio signals, metadata de- 
scribing the content of the audio/video signals may be 
input using the personal digital assistant 12. As shown 
in figure 2 the interface unit 1 8 provides a facility where- 
by the metadata added by the user using the personal 
digital assistant 12 may be received within the camera 
body 2. Data signals representative of the user gener- 
ated metadata are received via the connecting channels 
44, 46 at the interface unit 18. The interface unit 18 
serves to convert these signals into a form in which they 
can be processed by the control processor 28 which re- 
ceives these data signals via the connecting channels 
48,50. 

[0036] Secondary metadata is also generated by the 
secondary metadata generator 34. In the example em- 
bodiment illustrated in figure 2, the secondary metadata 
includes time codes which are generated with reference 
to the clock 36. The secondary metadata may also in- 
clude the aperture setting of the camera lens 4, the shut- 
ter speed 40 and a signal received via the control unit 8 
to indicate that the visual images captured are a "good 
shot". These signals and data are generated by the sen- 
sors 38, 40, 42 and received at the secondary metadata 
processor 34. In combination therefore in the example 
embodiment the secondary metadata generate 34 is ar- 
ranged to produce syntactic metadata which defines 
and provides operating parameters which are used by 
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the camera in generating the video signals. The second- 
ary or syntactic metadata is therefore also fed to the con- 
trol processor 28. 

[0037] In the example embodiment shown in figures 
1 and 2, the syntactic metadata is generated automati- 
cally at the time and during the generation of the video 
signals. However a remaining technical problem is how 
to record the secondary metadata on the magnetic tape 
26. In the example embodiment shown in figure 2. the 
control processor 28 is provided with the data store 32. 
Therefore during the process of capturing the video sig- 
nals, the control processor 28 stores the secondary 
metadata in the data store 32. At the time at which the 
user has finished recording the video signals which 
serve to capture a particular event, then the user may 
generate metadata using the personal digital assistant 
12. Once the user generated metadata is received with- 
in the control processor 28, the user generated metada- 
ta is combined with the secondary metadata stored in 
the data store 32 and written onto the magnetic tape 26 
using the read/write heads 24 by feeding appropriate 
signals via the connecting channel 30. In combination 
therefore a user is provided with a convenient and im- 
proved facility for adding metadata to the audio/video 
material generated by the video camera 1 . 
[0038] Examples of automatically generated second- 
ary metadata are good shot markers, Rec marks (indi- 
cating where recording starts/stops), GPS location, 
Time and Date, UMID (Unique Material Identifier), Cam- 
era settings, focus, zoom, etc. Examples of metadata 
added by the user manually associated with the source 
content after it has been created, are for example Cam- 
eraman, Journalist Notes, Style comments, Sugges- 
tions, Annotations, Location/Assignment comments, 
Shot identification such as Intro, Finish, Commentary, 
Voice Over, etc. In addition, there is much development 
in progress directed to the automatic extraction of meta- 
data from the content, for example by using recognition 
tools, for example face and feature detection tools, 
speech recognition tools etc., to identify features within 
the content, and thereby enable appropriate metadata 
to be added. 

[0039] A further advantageous feature of the example 
embodiment of the present invention is illustrated with 
reference to figure 3. In figure 3 an illustration is shown 
of a section of the magnetic tape 26 upon which the au- 
dio/video signals and the metadata have been recorded. 
As shown in figured, the audio/video signals are record- 
ed first in a first portion 54 whereas the combined user 
generated and secondary metadata is recorded onto a 
later.portion of the magnetic tape 56. Furthermore the 
control processor operates in combination with the re- 
cording heads 24 to record the user generated and sec- 
ondary metadata at a position at which the metadata will 
be read first by a reproducing or video scanning or play- 
ing apparatus before the audio/video signals recorded 
in the portion 54. This provides a particular advantage 
in that an editor is provided with a rapid indication of the 



10 

content of the audio/video source information present 
on the tape 56. — * ■ 

[0040] The process by which metadata is produced 
using the personal digital assistant 12 and combined 
5 with the secondary metadata to form part of the record- 
ing of the audio/video signals is illustrated with refer- 
ence to a flow diagram shown in figure 4. In figure 4 a 
first step in the operation of the process is that the video 
camera 1 generates audio/video signals representative 
10 of the visual images captured by the camera and the 
accompanying audio sound track. This is represented 
by process step 60. After the audio/video signals have 
been generated, the user specifies metadata which is 
to be added to the audio video signals and generates 
15 this user generated metadata using the personal digital 
assistant 12. This is performed at the processor step 62. 
During the capture of the audio/video signals, the cam- 
era 1 operates to automatically generate the secondary 
metadata which is associated with the audio/video sig- 
20 nals and this is performed by process step 64. The sec- 
ondary metadata and the user generated metadata are 
then combined in a process step 66 and in the final proc- 
ess step 68, the combined semantic metadata and user 
generated metadata are recorded onto the magnetic 
25 tape 26. 

[0041] A second embodiment of the present invention 
is shown in figure 5 in which a camcorder 70 is connect- 
ed to a personal computer 72 . The camcorder 70 is con- 
nected to the personal computer 72 via a connecting 

30 channel 74 and an interface unit 76. As with the first em- 
bodiment, the interface unit 76 may be arranged in ac- 
cordance with a predetermined format, and may corre- 
spond to a standardised interface thereby allowing a fa- 
cility for the camcorder 70 to be connected to any form 

35 of data processor of which the personal computer 72 is 
just one example. As with the first embodiment, the in- 
terface unit 76 serves to provide a facility by which meta- 
data may be introduced into the camcorder 70 to be re- 
corded with audio/video signals generated by the cam- 

40 corder. However, also shown in figure 5 is a microphone 
78. In the second embodiment shown in figure 5, the 
personal computer 72 is arranged to operate a speech 
conversion or dictation application program which pro- 
vides a facility for converting words spoken into the mi- 

45 crophone 78 by the user into text. In this way the user 
may speak desired metadata into the microphone 78 
which may be converted into the text metadata, and fed 
via the connecting channel 74 to the interlace unit 76. 
The metadata is then recorded onto the recording me- 

50 dium used by the camcorder as shown in more detail in 
figure 6. 

[0042] Figure 6 provides a more detailed block dia- 
gram of the camcorder body 75 which is shown in figure 
5. The camcorder body 75 is shown to comprise an im- 
55 age processing unit 82 which is arranged to convert im- 
ages formed by the imaging lens 71 into video signals 
representative of the images and also to combine these 
video images with audio signals detected by a micro- 
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phone of the camcorder (not shown). These audio/video 
signals are generated by the processor 82 and fed to a 
tape driving processor 84 which includes recording 
heads and operates to record the audio/video signals 
onto a magnetic tape 86, Also shown in figure 6 is the 
interface unit 76 which is connected to two connecting 
conductors which form the connection channel 74. The 
interface unit 76 is correspondingly connected to a 
metadata control processor 88 via two conductors 90, 
92. The metadata controller 88 is connected to the tape 
dive controller 84 via a connecting channel 94. The cam- 
corder is also provided with a user interface through 
which commands may be input. A user interface 96 is 
shown in figure 6 to have five control buttons 98 thor- 
ough which commands may be input. One of these op- 
erating buttons 98 is connected to a text building unit 
1 00 which is also connected to the image processor 82. 
The text building unit 1 00 also receives signals gener- 
ated from a jog shuttle input device 1 02 via a connecting 
channel 103. The text building unit 100 is triggered in a 
conventional camcorder by one of the operating buttons 
98 forming the user interface unit 96. This provides a 
facility through which a user may generate text such as 
a title or summary of the images captured by the cam- 
corder. The letters of each word are selected using the 
jog-shuttle device 102 in order to introduce the desired 
text. An example of this is shown in figure 7 which pro- 
vides an illustrative representation of the image formed 
by the camcorder and which would be viewed in the view 
finder 73. Here the title "Christmas 1999" is shown su- 
perimposed on the images captured within the field of 
view of the camcorder lens 71. The title "Christmas 
1 999" shown at a central position 1 04 within the screen 
is therefore superimposed with the images captured or 
shot by the camcorder 70 and recorded with these im- 
ages onto the magnetic tape 86. Similarly as shown in 
figure 7, conventional camcorders have a facility for 
adding the date/time 1 06 to the images captured by the 
camcorder, which form part of the image and therefore 
the video signals. In contrast however in accordance 
with the second embodiment of the present invention 
: the text generated from the personal computer 72 and 
received at the interface unit 76 is processed by the 
metadata processor 88 and if appropriate stored until 
an appropriate command is triggered after the desired 
images represented by the audio/video signals have 
been recorded onto the magnetic tape 86. After the au- 
dio/video signals have been recorded, the metadata is 
fed from the metadata processor 88 to the tape driving 
processor 84 via the connecting channel 94 and are re- 
corded onto the magnetic tape 86 after the audio/video 
signals. As such in preferred embodiments the metada- 
ta is recorded at the first position on the magnetic tape 
which would be read by a playing or editing unit so that 
an editor is provided with improved knowledge of the 
content of the magnetic tape 86. Thus the text generated 
as metadata is distinguished from the textual informa- 
tion of the summary or title of the audio/video signals 



104 or the date/time 1 06 generated by the text building 
unit 100 in that the summary^ title or the date/time. 106 
are recorded with the audio video/signals and as such 
form part of these audio/video signals. The user gener- 

5 ated metadata in contrast is separable from the audio/ 
video signals and so is recorded separate from and dis- 
tinct to the title, summary and date/time of the images. 
[0043] A third example embodiment of the present in- 
vention is shown in figure 8 embodied within a video 

10 camera 1 which corresponds substantially to the video 
camera which is shown in figure 1 and so the parts of - 
the video camera 1 shown in figure 8 which are common 
to figure 1 bear the same numerical designations. For 
brevity only the differences between the video camera 

15 shown in figure 8 and the video camera shown in figure 
1 will be described. The video camera shown in figure 
8 is provided with a metadata generation tool 110 which 
forms part of the video camera 1 . However as will be 
appreciated the metadata generation tool 110 could be 

20 formed separately from the video camera 1 and con- 
nected to the video camera 1 via an appropriate inter- 
face. Equally however the metadata generation tool 
could be connected to any other form of audio and/or 
video generation apparatus such as a camcorder or 

25 computer graphics tool or the like. The metadata gen- 
eration tool is provided with a user interface 112 having 
a screen 114 and a keypad 1 1 6. The keypad provides a 
means via which the user may enter commands and text 
which are received by a user interface data processor 

30 which will be described shortly. A more detailed diagram 
of the metadata generation tool shown in figure 8 is pro- 
vided by a schematic block diagram shown in figure 9. 
In figure 9 the metadata generation tool 1 1 0 is shown to 
have a user interface data processor 118 which is oper- 
as atively coupled to the screen 1 1 4 and the keypad 1 1 6 of 
the user interface 112 of the metadata generation tool 
although this is not shown in f igure 9. The metadata gen- 
eration tool 11 0 is shown in figure 9 to have a metadata 
processor 1 20 coupled to four metadata generation sen- 

^o sors 122, 124, 126, 128. The user interface data proc- 
essor 118 is connected to the metadata processor 120 
by a control channel 130 and a metadata channel 132. 
Also connected to the user interface data processor 1 1 8 
is a data store 134 via a connecting channel 136. 

45 [0044] In operation the metadata processor operates 
to receive signals from the metadata sensors 122, 124, 
126, 128 which are representative of values of param- 
eters which are identified as metadata and which are 
generated in association with the audio/video signals 

50 generated by for example the video camera 1 . The user 
interface data processor 1 1 8 is arranged to receive com- 
mands from the user via the keypad 116 which indicate 
which of a plurality of types of metadata the user wishes 
to generate with the audio/video signals which are re- 

55 corded in association with the audio/video signals by the 
video camera 1 . An example of possible metadata types 
is shown in a table in figure 10. In figure 10 the first row 
provides an indication of six possible metadata types 
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which are "Time" which is the time at which the audio/ 
video signals are generated, "Date" which is the date at 
which the audio/video signals are generated, a location 
at which the audio/video signals are generated as pro- 
vided by a "GPS" (Global Positioning System) data val- 
ue, "F Stop" which is representative of the aperture set- 
ting of the video camera when the audio/video signals 
were generated, "Rec" which is representative of a point 
at which recording starts and stops, and finally "Text" 
which is a text value input by the user via the keypad 
116 which the user is free to provide and which is indic- 
ative of some value that the user considers important. 
[0045] Whilst the user is free to select any of the meta- 
data types available to the video camera, as shown in 
figure 1 0 the metadata types are divided into three meta- 
data selections. Each of these selections is represented 
as a corresponding row of the table and indicated as 
selection 1 SEL1 , selection 2 SEL2, selection 3 SEL3 
and finally a default setting DEFLT The type of metadata 
generated for each of the selections is indicated by a 
corresponding cross X in the table cell below the column 
which indicates the metadata type. So for example for 
selection 1 SEL1 the time, date and text will be gener- 
ated as metadata. The user is arranged to select one of 
the selections by inputting appropriate commands, via 
the keypad 116 and the user's desired selection is rep- 
resented on the screen 114. The desired selection is 
processed by the user interface data processor 118 
which further operates to communicate control signals 
representative of this desired selection to the metadata 
processor 120. The metadata processor 120 is there- 
fore correspondingly configured to generate the meta- 
data values for each of the metadata types from the sig- 
nals received from the metadata sensors 1 22, 1 24 , 1 26, 
1 28. The metadata processor 1 20 then operates to gen- 
erate signals representative of the selected values 
which are output by the metadata generation tool 11 0 at 
an output channel 1 40 to be associated with the audio/ 
video signals. If the user selects a selection which allows 
for text to be added as metadata, then the text may be 
prompted by the user interface data processor 118 
which may be input via the keypad 116 and stored in the 
data store 134. The data store 134 may also serve to 
store the metadata selection the user has selected. Sig- 
nals representative of a user input text are then fed via 
the metadata channel 132 from the user interface data 
processor 118 to the metadata processor 120. 
[0046] Whilst the range of different types of metadata 
may be very large and the format in which the metadata 
is generated may also vary greatly, the SMPTE-EBU is 
currently investigating and standardising different types 
and formats for metadata. As such one or more of the 
categories which may be available to the user via the 
user interface may correspond to one of the plurality of 
predetermined standards. In this way the user may se- 
lect the metadata standard which is most appropriate 
for the type of audio/video signals which are being gen- 
erated and will be recorded to represent the content of 
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the audio/video source. The metadata standard is used 
to produce and to facilitate editing-of the audio/video sig- 
nals to produce an audio/video production. 
[0047] As will be appreciated by those skilled in the 
5 art various modifications may be made to the embodi- 
ments herein before described without departing from 
the scope of the present invention. For example whilst 
embodiments have been described with recording au- 
dio/video onto magnetic tape, it will be appreciated that 
other recording media are possible. Furthermore al- 
though the user generated metadata has been repre- 
sented as text information, it will be appreciated that any 
other forms of metadata may be generated either auto- 
matically or under control of the user and received within 
the audio and/or video generation apparatus via an in- 
terface unit. Correspondingly the secondary metadata 
may be any form of semantic or syntactic metadata. 
[0048] As will be appreciated those features of the in- 
vention which appear in the example embodiments as 
a data processor or processing units could be imple- 
mented in hard ware as well as a software computer pro- 
gram running on an appropriate data processor. Corre- 
spondingly those aspects and features of the invention 
which are described as computer or application pro- 
grams running on a data processor may be implement- 
ed as dedicated hardware. It will therefore be appreci- 
ated that a computer program running on a data proc- 
essor which serves to form an audio and/or video gen- 
eration apparatus as herein before described is an as- 
pect of the present invention. Similarly a computer pro- 
gram recorded onto a recordable medium which serves 
to define the- method according to the present invention 
or when loaded onto a computer forms an apparatus ac- 
cording to the present invention are aspects of the 
present invention. 



Claims 

1 . An audio and/or video generation apparatus which 
is arranged in operation to generate audio and/or 
video signals representative of an audio and/or vid- 
eo source having a meta data generation tool which 
is arranged in operation to generate meta data as- 
sociated with said audio and/or video signals, 
wherein the type of meta data generated by said 
meta data generation tool is user selectable. 

2. An audio and/or video generation apparatus as 
claimed in Claim 1 , wherein said meta data gener- 
ation tool is arranged to generate meta data of a 
plurality of different types, and said meta data gen- 
eration tool is provided with a user interface for se- 
lecting at least one of the plurality of different types 
of meta data, said meta data being generated in op- 
eration in accordance with the content of said audio 
and/or video signals in accordance with said user 
selected meta data types. 
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3. An audio and/or video generation apparatus as 
claimed in Claims 1 or 2, wherein said meta data 
generation tool is arranged to generate meta data 
of a default type in absence of a user selecting from 
said plurality of meta data types. 

4. An audio and/or video generation apparatus as 
claimed in Claims 1 , 2 or 3, wherein said meta data 
generation tool is arranged to generate meta data 
in accordance with a plurality of p re-determined me- 
ta data selections, each of which specifies at least 
one of said plurality of different meta data types ar- 
ranged in a pre-determined format, said user inter- 
face being arranged to provide the user with a facil- 
ity for selecting one of the meta data selections for 
generating said meta data. 

5. An audio and/or video generation apparatus as 
claimed in Claim 4, wherein each of said plurality of 
meta data selections is arranged in accordance with 
a standard defining the type and format of meta data 
to be generated with said audio and/or video infor- 
mation. 

6. An audio and/or video generation apparatus as 
claimed in any preceding Claim, comprising a re- 
cording/reproducing apparatus which is arranged in 
operation to record said audio and/or video signals 
with said meta data on a recordable medium, in 
form in which said meta data may be recovered sep- 
arately from said audio/video signals. 

7. An audio and/or video generation apparatus as 
claimed in any preceding Claim, wherein said audio 
and/or video generation apparatus is a video cam- 
era, camcorder, television camera, cinema camera 
or the like. 

8. A meta data generation tool for use in generating 
metadata in association with an audio and/or visual 
generation apparatus, said generation tool being 
arranged in operation to generate meta data asso- 
ciated with audio and/or video signals, wherein the 
type of meta data generated by said tool is user se- 
lectable. 

9. A meta data generation tool as claimed in Claim 8, 
comprising a user interface which is arranged to 
provide a facility for selecting at least one of a plu- 
rality of different types of meta data, said meta data 
being generated from said audio and/or video sig- 
nals in accordance with said user selected meta da- 
ta types. 

10. A meta data generation tool as claimed in Claims 8 
or 9, arranged in operation to generate meta data 
of a default type in absence of a user selecting from 
said plurality of meta data types. 
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11. A meta data generation tool as claimed in any of 
Claims 8, 9 or 10, wherein-said tool is arranged to 
generate meta data in accordance with a plurality 
of pre-determined meta data selections, each of 

5 which has at least one of said plurality of meta data 
types arranged in a pre-determined format, said us- 
er interface being arranged to provide the user with 
a facility for selecting one of said meta data selec- 
tions for generating said meta data. 

10 

1 2. A meta data generation tool as claimed in Claim 11/ 
wherein each of said plurality of meta data selec- 
tions is arranged in accordance with a standard de- 
fining the type and format of meta data associated 

15 with said audio and/or video information. 

1 3. A method of generating audio and/or video informa- 
tion signals, comprising the steps of 

20 - identifying the nature of the audio and/or video 
information signals to be generated, 
selecting appropriate types of meta data to be 
generated with the audio and/or video signals, 
and 

25 - generating the meta data of the selected type 
in association with the audio and/or video infor- 
mation signals. 

14. A method as claimed in Claim 13, wherein the step 
30 of selecting appropriate types of meta data compris- 
es the steps of 

providing a plurality of different selectable meta 
data types, and 
35 - selecting at least one of the plurality of different 
meta data types, said meta data being gener- 
ated in operation from said audio and/or video 
signals in accordance with said user selected 
meta data types. 

40 

15. A method as claimed in Claim 14, wherein the step 
of selecting at least one of the plurality of different 
meta data types comprises the step of 

45 . providing a default selection having at least one 
meta data type in absence of a user selecting 
from said plurality of meta data types. 

16. A method as claimed in any of Claims 14 or 15, 
50 wherein the step of selecting at least one of the plu- 
rality of meta data types, comprises the steps of 

providing a plurality of pre-defined meta data 
selections, each of which selections specifies 
55 at least one of said plurality of meta data types 

arranged in a predetermined format, and 
selecting one of said pre-defined meta data se- 
lections. 
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17. A method as claimed in Claim 16, wherein each of 
said plurality of meta data selections is arranged in 
accordance with a standard defining the type and 
format of meta data associated with said audio and/ 
or video information signals. 

18. A computer program providing computer executa- 
ble instructions, which when loaded on to a compu- 
ter configures said computer to operate as an audio 
and/or video generation apparatus as claimed in 
any of Claims 1 to 7. 

19. A computer program providing computer executa- 
ble instructions, which when loaded on to a compu- 
ter configures said computer to operate as a meta 
data generation tool as claimed in any of Claims 8 
to 12. 

20. A computer program having computer executable 
instructions, which when loaded on to a computer 
causes the computer to perform the method accord- 
ing to any of Claims 13 to 17. 

21. A computer program product having a computer 
readable medium having recorded thereon informa- 
tion signals representative of the computer program 
Claimed in any of Claims 18 or 19. 

22. An audio and/or video generation apparatus which 
is arranged in operation to generate audio and/or 
video signals representative of an audio and/or vis- 
ual source, said audio and/or video generation ap- 
paratus comprising 

a recording means which is arranged in opera- 
tion to record said audio and/or video signals 
on a recording medium, wherein 
said audio and/or video generation apparatus 
is arranged to receive meta data associated 
with said audio and/or video signals generated 
by a data processor, said recording means be- 
ing arranged in operation to record said meta 
' data on said recording medium with said audio 
and/or video signals. 

23. An audio and/or video generation apparatus as 
claimed in Claim 22, comprising an interface having 
a predetermined format for connecting said data 
processorto said audio and/orvideo generation ap- 
paratus, whereby said generation apparatus is ar- 
ranged to receive said meta data. 

24. An audio and/or video generation apparatus as 
claimed in Claims 22 or 23, wherein said meta data 
is user generated using said data processor. 

25. An audio and/or video generation apparatus as 
claimed in Claims 22, 23 or 24, comprising a meta 



data generator coupled to said recording means, 
said meta data generator- being arranged in opera- 
tion to generate secondary meta data associated 
with said audio and/or video signals, wherein said 
5 meta data and said secondary meta data are ar- 

ranged to be recorded in combination on said re- 
cording medium. 

26. An audio and/or video generation apparatus as 
10 claimed in Claim 25, wherein said secondary meta 

data is semantic meta data representative of oper- 
ating parameters of said audio and/or video gener- 
ation apparatus when said audio and/orvideo audio 
and/or video signals are generated. 

15 

27. An audio and/or video generation apparatus as 
claimed in any of Claims 24 to 26 , wherein said user 
generated meta data is text information. 

20 28. An audio and/or video generation apparatus as 
claimed in any of Claims 22 to 27, wherein said data 
processor has a speech conversion processor 
which is arranged in operation to generate said text 
information by converting sound signals represent- 

25 ative of spoken communication by said user. 

29. An audio and/or video generation apparatus as 
claimed in Claims 27 or 28, wherein said data proc- 
essor is a personal computer. 

30 

30. An audio and/or video generation apparatus as 
claimed in any of Claims 22 to 28, wherein said data 
processor is a personal digital assistant. 

35 31. An audio and/or video generation apparatus as 
claimed in any of Claims 22 to 30, wherein said au- 
dio and/or video generation apparatus is a video 
camera, television camera or a camcorder or the 
like. 



40 
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32. An audio and/or video generation apparatus as 
claimed in any of Claims 22 to 31 , wherein said re- 
cording medium is a linear recording medium, and 
said recording means is arranged in operation to 
record at least one of said user generated meta data 
and said secondary meta data on said recording 
medium after said audio and/or video signals at a 
position at which said at least one of user generated 
and secondary meta data can be read by a repro- 
ducing means before said audio and/or video sig- 
nals. 

33. A method of recording audio and/or video signals 
comprising the steps of 

a recording said audio and/or video signals on 
a recording medium, 

generating automatically meta data in re- 



10 



BNSOCCID: <EP 1130594A1J_> 



19 



EP1 130 594 A1 



20 



sponse to said audio and/or video signals, 
providing user generated meta data associated 
with said audio and/or video signals, and 
recording said user generated and said auto- 
matically generated meta data on said record- 
ing medium with said audio and/or video sig- 
nals. 

34. A method as claimed in Claim 33, wherein said user 
generated meta data is provided from a data proc- 
essor. 

35. A method as claimed in Claim 33 or 34, wherein said 
automatically generated meta data is semantic me- 
ta data representative of operating parameters of 
said audio and/or video generation apparatus when 
said audio and/or video signals are generated. 

36. A method as claimed in any of Claims 34 to 36, 
wherein said user generated meta data is text infor- 
mation. 

37. A method as claimed in any of Claims 33 to 36, 
wherein said recording medium is a linear recording 
medium, said method comprising the step of . 

recording at least one of said user generated 
meta data and said automatically generated 
meta data on said recording medium after said 
audio and/or video signals at a position at which 
said at least one of said user generated and 
said automatically generated meta data can be 
read by a reproducing means before said audio 
and/or video signals. 

38. A computer program providing computer executa- 
ble instructions, which when loaded on to a compu- 
ter configures said computer to operate as an audio 
and/or video generation apparatus as claimed in 
any of Claims 22 to 32. 

39. A computer program having computer executable 
instructions, which when loaded on to a computer 
causes the computer to perform the method accord- 
ing to any of Claims 33 to 37. 

40. A computer program product having a computer 
readable medium having recorded thereon informa- 
tion signals representative of the computer program 
Claimed in any of Claims 38 or 39. 

41. An audio and/or video generation apparatus which 
is arranged in operation to generate audio and/or 
video signals representative of an audio and/or vis- 
ual source, said audio and/or video generation ap- 
paratus comprising 

a recording means which is arranged in opera- 



tion to record said audio and/or video signals 
on a recording mediumrwherein said recording 
medium is a linear recording medium, and said 
recording means is arrange in operation 

5 to record meta data associated with said audio 

and/or video signals on said recording medium 
after said audio and/or video signals at a posi- 
tion at which said meta data can be read by a 
reproducing means before said audio and/or 

10 video signals. 

42. An audio and/or video generation apparatus as 
claimed in Claim 41, comprising a meta data gen- 
eration tool which is arranged in operation to gen- 

15 erate said meta data associated with said audio 
and/or video signals. 

43. An audio and/or video generation apparatus as 
claimed in Claims 41 or 42, comprising a receiving 

20 means for receiving said meta data, wherein said 
meta data is user generated. 

44. An audio and/or video generation apparatus as 
claimed in any of Claims 41 to 43, wherein said lin- 

25 ear recording apparatus is a magnetic tape. 

45. A method of recording audio and/or video signals 
representative on to a linear recording medium, 
said method comprising the steps of 

30 

recording said audio and/or video signals on to 
said linear recording medium, 
recording meta data associated with said audio 
and/or video on to said linear recording medium 
35 after said audio and/or video signals at a posi- 

tion at which said meta data can be read by a 
reproducing means before said audio and/or 
video signals. 

40 46. A computer program providing computer executa- 
ble instructions, which when loaded on to a compu- 
ter configures said computer to operate as an audio 
and/or video generation apparatus as claimed in 
any of Claims 41 to 44. 

45 

47. A computer program having computer executable 
instructions, which when loaded on to a computer 
causes the computer to perform the method accord- 
ing to Claim 45. 

50 

48. A computer program product having a computer 
readable medium having recorded thereon informa- 
tion signals representative of the computer program 
Claimed in any of Claims 46 or 47. 

55 
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